Feature Selection with Exception Handling Using Adaptive Distance Measures -an Example from Phonetics
نویسنده
چکیده
The goal in this paper is to show how the classiication of patterns of phonetic features (=phones) to phonemes can be acquired. This classi-cational process is modelled by a supervised feature selection method, based on a weighted Hamming distance, augmented by Boolean functions describing exceptions. An important aspect is the diierentiation of rules and exceptions during learning. 1 Phonetic features and Phonemes The goal in this paper is to show how the classiication of patterns of pho-netic features (=phones) to phonemes can be acquired. In every language a number of diierentiable phones belong to one phoneme. To learn the phonemic pattern of a language amounts to learn a classiication of all naturally occurring phones to a phoneme. The continuum of articulatory places or acoustically deened frequency formants for a single phone can be cut up into a set of descriptive features. Phones are then represented as in IPA-notation, where a single symbol is used for each naturally attested occurence of a bundle of phonetic features, or directly by a number of phonetic segmental features. Phonetic features for German vowels, the corresponding patterns and their intended classiication are shown in Table 1, adapted from Hym75]. (Non-segmental features such as length,syllabicity, nasalization, rhotaciza-tion, diphthongs have been excluded from this analysis.) As has been noted by phonologists for a long time ((Tru58]), the grouping of phones to phonemes gives rise to phonological systems, organized 1
منابع مشابه
Feature Selection Using Multi Objective Genetic Algorithm with Support Vector Machine
Different approaches have been proposed for feature selection to obtain suitable features subset among all features. These methods search feature space for feature subsets which satisfies some criteria or optimizes several objective functions. The objective functions are divided into two main groups: filter and wrapper methods. In filter methods, features subsets are selected due to some measu...
متن کاملParallel Visual Information Retrieval in VizIR
This paper describes how parallel retrieval is implemented in the content-based visual information retrieval framework VizIR. Generally, two major use cases for parallelisation exist in visual retrieval systems: distributed querying and simultaneous multi-user querying. Distributed querying includes parallel query execution and querying multiple databases. Content-based querying is a two-step p...
متن کاملA New Hybrid Feature Subset Selection Algorithm for the Analysis of Ovarian Cancer Data Using Laser Mass Spectrum
Introduction: Amajor problem in the treatment of cancer is the lack of an appropriate method for the early diagnosis of the disease. The chemical reaction within an organ may be reflected in the form of proteomic patterns in the serum, sputum, or urine. Laser mass spectrometry is a valuable tool for extracting the proteomic patterns from biological samples. A major challenge in extracting such ...
متن کاملIFSB-ReliefF: A New Instance and Feature Selection Algorithm Based on ReliefF
Increasing the use of Internet and some phenomena such as sensor networks has led to an unnecessary increasing the volume of information. Though it has many benefits, it causes problems such as storage space requirements and better processors, as well as data refinement to remove unnecessary data. Data reduction methods provide ways to select useful data from a large amount of duplicate, incomp...
متن کاملOnline Streaming Feature Selection Using Geometric Series of the Adjacency Matrix of Features
Feature Selection (FS) is an important pre-processing step in machine learning and data mining. All the traditional feature selection methods assume that the entire feature space is available from the beginning. However, online streaming features (OSF) are an integral part of many real-world applications. In OSF, the number of training examples is fixed while the number of features grows with t...
متن کامل